Distributed Policy Evaluation Under Multiple Behavior Strategies
نویسندگان
چکیده
منابع مشابه
Data-Efficient Policy Evaluation Through Behavior Policy Search
We consider the task of evaluating a policy for a Markov decision process (MDP). The standard unbiased technique for evaluating a policy is to deploy the policy and observe its performance. We show that the data collected from deploying a different policy, commonly called the behavior policy, can be used to produce unbiased estimates with lower mean squared error than this standard technique. W...
متن کاملEvaluation of Join Strategies for Distributed Mediation
Three join algorithms are evaluated in an environment with distributed main-memory based mediators and data sources. A streamed ship-out join ships bulks of tuples to a mediator near a data source, followed by post-processing in the client. An extended streamed semi-join in addition builds a main-memory hash index in the client mediator. A ship-in algorithm materializes and joins the data in th...
متن کاملHedging Strategies: Electricity Investment Decisions under Policy Uncertainty
Given uncertainty in long-term carbon reduction goals, how much non-carbon generation should be developed in the near-term? This research investigates the optimal balance between the risk of overinvesting in non-carbon sources that are ultimately not needed and the risk of underinvesting in non-carbon sources and subsequently needing to reduce carbon emissions dramatically. We employ a novel fr...
متن کاملRisk Hedging Strategies under Energy System and Climate Policy Uncertainties
The future development of the energy sector is rife with uncertainties. They concern virtually the entire energy chain, from resource extraction to conversion technologies, energy demand, and the stringency of future environmental policies. Investment decisions today need thus not only to be cost-effective from the present perspective, but have to take into account also the imputed future risks...
متن کاملProvider Behavior Under Global Budgeting and Policy Responses
Third-party payer systems are consistently associated with health care cost escalation. Taiwan's single-payer, universal coverage National Health Insurance (NHI) adopted global budgeting (GB) to achieve cost control. This study captures ophthalmologists' response to GB, specifically service volume changes and service substitution between low-revenue and high-revenue services following GB implem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Automatic Control
سال: 2015
ISSN: 0018-9286,1558-2523
DOI: 10.1109/tac.2014.2368731